Sorting with the APPLE II 





The first of a series of articles which will deal with sort- 


ing in general and on the APPLE II in particular. This in- si we een ees 
stallment presents some background material, a com- Tolland, CT 06084 

parison of three sorting techniques, and a program for 

implementing the Sheil-Metzner sort. 

Whether you are maintaining complex Tabie | — Sorting in BASIC 

data bases, compiling mailing lists, or 

simply keeping track of your checkbook, SORTED WORDS 

at some time you will need to sort 

records. There are a multitude of sorts SORT 10 100 500 1000 
available — from the ນ slow 

one in the APPLE CHECK pro- 

Ora: through the relatively fast BASIC SHELL-METZNER 1 34 268 647 
sort, to my exceedingly fast (by a factor BUTTERFLY-HART 2 38 266 606 
of 200) machine language sort. What HEAPSORT i 35 261 600 


makes a sort fast, and which sort is the 
fastest? These are the questions | will 
cover in my series on exploring sorting 
with the APPLE į. 


Background 


There have been many magazine articles 
written on sorting. The ones | based my 
initial investigation on were those in the 
Nov-Dec 1976 issue of CREATIVE COM- 
PUTING covering the SHELL-METZNER, 
bubbie, delayed replacement, and heap- 
sorts, and the JAN-FEB 1978 issue of the 
same magazine on the Butterfly-Hart 
sort. The first article found the Sheli- 
Metzner and heapsorts to be a vast im- 
provement over the bubbie and delayed 
replacement sorts. The second article 
found the Butterfly-Hart to be even 
taster. The Sheil-Metzner and heapsort 
are replacement-type sorts; that is, the 
records are compared to one another 
and repiace each other according to 
some unique algorhythm. They are rela- 
tively small in size and don't rely on 
much extra storage for their processing. 
The Butterfiy-Hart is a linked list sort. A 
tree structure is built from the records 
and broken down into several smaller 
sorted lists. These lists are then merged 
to torm the final result. This sort is much 
faster for large numbers of records, but 
is quite complex and requires extra 
storage to hold the lists and tree struc- 
ture. For more details on how these 

operate, | leave you to refer to the 
orig sucios. 


I programmed each of these sorts in 
INTEGER BASIC and compared them by 
sorting various numbers of random ten 
character strings. Below were the 
resuits. 


S 


(All sorting times in seconds) 


For further exploration, i decided to use 
the Shell-Metzner sort because it was 
easiest to program and most compact. 
Many things had to be taken into ac- 
count before implementing this sort in 
INTEGER BASIC. Because of the limited 
string support in this BASIC, it is easier 
to store records to be sorted in memory 
between the upper end of the data 
variables and the lower end of the pro- 
gram area, accessing them with PEEK’s 
and POKE’s. At first, as | sorted these 
records, | exchanged the actual records 
in memory when necessary. This 
becomes very time consuming because 
for exchanging two 10 character records, 
you must move 30 bytes (10 to a work 
area, 10 from one record to the other, 
and 10 from the work area back to the 
other record). A much more elegant 
technique is to store the address of each 
record as a member of an array. When an 
exchange is necessary, you need only 
exchange the addresses in the array, a 
totai move of 6 bytes (2 + 2 + 2) for any 
size record. When the sort is complete, 
the addresses of the sorted records can 
be found sequentially in the array. The 
first member of the array will point to the 
lowest sorted record, and las 
member to the highest sort 


order quite simply, and can easi 
sorted in reverse order simply by readi 
the array backwards. The beauty of th 
method is tffgFfhe records have never ac- 
tually moved and can be read in the 
origina! order as simply as the gorted 
order. This reduction alone incr; the 















speed of the sort by a factor of three for 
a 100 record sort, and exponentially 
above that. 


My BASIC version is divided into several 
parts. The first part generates random 
character strings in memory, depending 
on the record size and count entered. 
This is for benchmark tests and can be 
replaced with your own VO routine for 
your application. Line 140 actually puts 
the random characters in memory, so 
replacing this line with a REM after your 
first run allows you to test other sorting 
methods while using the same records. 
The second part merely initializes the 
memory pointer array and prints the un- 
sorted strings. This can also be included 
in your VO routine. The third part is the 
actual SHELL-METZNER sort. The 
routine can be easily changed if you 
wish to sort numbers in an array instead 
of strings in memory. Finally, there is a 
routine to print the results, and a handy 
routine from CALL-APPLE for finding the 
address of a variable in the data area. 


SWEET-16 for Size 


ever being satisfied, | decided to con- 
inue another step and try to program the 
routine in SWEET-16 (as ali you 
APPLE people know, a 16 bit interpreter 
mplemented in ROM). An excellent ar- 
ticle in the NOV 1977 issue of BYTE 
(or the BEST OF BYTE VOL 1) was my 
source for SWEET-16 information. 
SWEET-16 was 4 to 9 times faster than 
the BASIC sort, and very compact due 


to the powerful instruction set. But due 
to difficulty in implementing, and be- 
cause the machine language routine was 
several orders of magnitude faster, | 
am not including this material. Don’t 
feel bad. Because | know of no SWEET- 
16 assembler, writing this program was 
actually harder than the machine 
language version. 


Machine Language for Speed 


The machine language Implementation 
of SHELL-METZNER was not difficult, 
because ! was almost translating direct- 
ly from each BASIC statement into 
equivalent functions in machine code. 
As you can see by the listing in Figure 2, 
| made extensive use of PAGE ZERO 
addressing, both to cut down on code 
and increase speed. | left in BASIC al! 
the 1/0 routines and setup necessary 
to prepare the sort, since this is quite 
easy in BASIC and | already had the pro- 
gram written from the first probiém. 
The actual sort algorhythm is the only 
part | programmed in machine code. 
Thus we get the benefit of BASIC for HO, 
printing, etc. in 1% of the execution, and 
the machine code speed for the repeta- 
tive Iooping in 99% of the execution. 

Using this machine language sort is rel- 
atively easy. The BASIC routine in Figure 


3 sets up the variables needed by the 
sort and calls the machine language 
routine. It can be substituted for the sort 
routine in the BASIC version in Figure 1 
(lines 1000-1900). The sort routine itself 
(in Figure 2) is loaded at address 300- 
3C2. This routine is easily relocatable 
to any other address (say 800 if you are 
using 300 for another routine). All you 
need to do is load it where desired and 
change the last two instructions (2 JMP 
commands) to reflect your new location. 
You must, of course, change the CALL 
in your BASIC program aiso. 

Below is a comparison of my three dif- 
ferent implementations of the Shell- 
Metzner sort. 


The maximum number of records you 
can sort is easily determined by taking 
the memory size between data high and 
program low and dividing it by the record 
size + 2 (the size of the array element 
needed to hold the pointer to the record). 
| find with a 32K machine running DOS, 
I have 18K free. More memory is avail- 
able if you want to lose DOS of course. 
Machine language routines may be more 
trouble to implement, but with an in- 
crease in speed over BASIC by 3 factor 
of 200, you cannot ignore them. 


u 


Table ii — Comparison of Three Methods 


SORTED WORDS X WORD LENGTH 


METHOD 500 X 10 
BASIC 268 
SWEET-16 46 
MACHINE 1 


1000 X 10 3600 X 3 
746 4200 (70 min) 
158 = 

3 21 


(All sorting times in seconds) 


Figure 1 


19 REM KKEKKKkKkEKkKEkKkEKKKKkKEKRKkRKEKEEKK K 


20 REM k 
30 REM k 


SHELL-METZNER SORT k 
BY GARY FOOTE k 


40 REM kkkkkkkkkkkkkkkkkkkkkkkkkkkk 


50 CALL -936:3 PRINT š 


60 INPUT "ENTER RECORD CO 


70 DIM A$(255) p A( NUM) 


80 IzJzKzLzMzXST=#zZ=LLz=II=LM 


PRINT 


90 LMF PEEK (208) + PEEK (205) 2563 HMs 32767 


PRINT "CREATING RANDOM STRINGS" 


95 REM 

199 REM kkkkkk FILL MEMORY WITH DATA kkkkkk 
105 REM 

110 PRINT š 

129 IF LMELENKNUMCHM THEN 140 


130 PRINT *TOO MUCH DATAL!": END 


140 FOR Xz1 TO LEN*NUM? POKE !.“ໆ+%, 


150 REM 


990 REM kkkkkk£ INITIALIZE MEMORY POIN 


205 REM 
210 AS="AS": 


GOSUB 4009 


220 FOR Xz1 YO NUMEA(X) 2(Xo1) eLENtLM+D 


230 
240 NEXT X 
250 REM 


T2A(X)$ GOSUB 3000 


161 


RND (26! +193: 


"SHELL -METZNER SORT": PRINT 
UNT AND LENGTH", NUM, LEN 


ZHMZADDRzW ZN? REM SAVE SPACE FOR VARIABLES 


NEXT X 


TER ARRAY kkkkkk 


1909 
1019 
1100 
1200 
1309 
1400 
1590 
1609 


1700 
1809 
1909 
1910 
2099 
2095 
2019 
2020 
2030 
2040 
2059 
2069 
3900 
3095 
3019 
3920 
3030 
3040 
3950 
3069 
4090 
4005 
4019 


4020 
4039 
4940 


4059 


1099 
1019 
1100 
1200 
1309 
1400 
1500 


1600 
1700 


REM 6ແແແແແແ SORT ROUTINE ekeKen 
REM 

PRINT : PRINT "STARTING SORT" 

NENUMSM=N 

M=M/23 IF M=0 THEN 1900; ເວໄ-າ: J21 
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LZIrFMIIIZAC(I)ILLZA(L) 

FOR X#0 TO LENeliws PEEK (11#%) - PEEK (LL+X)! IF wen THEN 18003: IF 
W>0 THEN 1700: NEXT X: GOTO 1800 

TACT) SAT) SACL) SA(L) Tey zt -M IF I>21 THEN 1500 

J=Jti? IF J>K THEN 1309! GOTO 1409 

PRINT $ PRINT "ENDING SORT" 

REM 

REM kkkkkk PRINT RESULTS kkkkkk 

REM 

AS="AS"3 GOSUB 40090 

FOR X=1 TO NUM 

TCA(X) 3 GOSUB 3009 

NEXT X 

END 

REM 

REM kkkkkk STRING PRINT ROUTINE kkkkkk 

REM 

FOR 2=0 TO LEN-1 

POKE ADDR+Z» PEEK (TZ)! REM ARRAY A$ 

NEXT 2: POKE ADDR+2+-30 

PRINT Xs A$ 

RETURN 

REM 

REM kkkkk FIND VARIABLE'S ADDRESS 

REM 

ADDR= PEEK (74)+ PEEK (75) «256*12K= LEN(AS)3JUz PEEK (204)+ PEEK (205 
,*2564«18L50: IF AS(KeK) "S$" THEN %020:ແ2ແ-1:( ວາ 

FOR Y=. TO K: IF ASC(AS(I))# PEEK (ADDR+I) THEN 4040: NEXT I 

IF PEEK (ADDR+I+L)>2 THEN 406403 ADDREADDREKFHGEtLS RETURN 

FOR Iz=1 TO 2109: JF PEEK (I+ADOR)>1 THEN NEXT TS IZADDOR+I+12ADDR= PEEK 
(T)+ PEEK (3+1)*«256-) 

IF ADDR<J THEN 4020: PRINT "VARIABLE ")A$S;" NOT FOUND": END 


Figure 2 


REM #ແແແແ%ແ SORT ROUTINE kkkkkk 
REM 

PRINT ; PRINT "STARTING SORT" 

AS="A"s GOSUB 40090 

POKE 0, ADDR MOD 256: POKE 1/ADDR/256: REM STORE ARRAY ADDRESS 

POKE 2eLEN: REM STORE RECORD LENGTH (MUST BE ເ 256) 

POKE GeNUM MOD 256: POKE 5eNUM/2563 POKE 6*NUM MOD 256: POKE 7,NUM/ 
256: REM STORE NUMBER OF RECORDS 

CALL 768: REM CALL SORT ROUTINE 

PRINT š PRINT "ENDING SORT” 
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0309- 
0302- 
9304- 
0396- 
0306+ 
050ຕໍ+ 
0308“ 
030Ce 
030E» 
0310- 
0312” 
03214» 
0316- 
0318- 
031A» 
031C- 
031E- 
0320> 
0322 ~ 
0324- 
0526” 


46 
66 
00 
AS 
DO 
60 
38 
A5 
ES 
85 
A5 
ES 
85 
A9 
85 
AS 
85 
A5 
85 
AS 
85 


07. 


96 
05 
07 
91 


04 
06 
OE 
05 
07 
OF 
01 


09 
00 
oc 
08 
0D 
09 


1990 
1010 
1020 
1039 
1040 
1950 
1060 
1070 
1980 
1090 
1190 
1110 
1120 
1130 
1140 
1150 
1169 
1179 
1180 
1190 
1209 
1210 
1229 
1230 
1240 
1250 
1260 
1270 
1280 
1299 
1309 
1510 
1320 
1330 
1340 
1350 
1360 
1370 
1380 
1390 
1409 
1410 
1420 
1430 
1440 
1450 
14690 
1470 
1480 
1490 
1509 
1510 


Figure 3 


Kenran nnn emer ern ແາ 5າາແາຈາແຈຈ<ຈ = ຈ ພ = - k 
k SHELL -METZNER SORT k 
k BY GARY A. FOOTE k 
k COPYRIGHT 1979 k 
k COMMERCIAL RIGHTS RESERVED kœ 
Kemwnetre nn ene ne s... nwezen anes ewan - K 
* 

k VARIABLES AND CONSTANTS 

ໄ່ 

k ALL VARIABLES ARE TWO BYTES, 


k THE LISTED NAME IS THE LOW ORDER BYTE. 
k THE NAME+1 IS THE HIGH ORDER BYTE, 

k EX, I > LOW ORDER BYTE 

k T+1 = HIGH ORDER BYTE 

k 

ADRA EQ $00 
LEN .EQ@ $02 


ARRAY A ADDRESS 
RECORD LENGTH 


N „EQ $04 NUM OF RECORDS 

M EQ $06 M 

I EQ 508 I (RECORD I) 

L „EQ 5042 L (RECORD L) 

J EQ 506 J 

K „EQ $0E K 

PTRI EQ $10 PTR TO ADDR OF ACT) 


PTR TO ADDR OF (ໄ) 
ADDR OF REC A(T) 
ADDR OF REC A(L) 


PTRL ,EQ $12 
ADRI ,EQ $14 
ADRL EQ $16 
k 


k SORT ROUTINE 


„OR $309 
SORT LSR Mti M = M /⁄ 2 
ROR M 
BNE SRT1 IF M z 0 
LDA Mr1 
BNE SRT1 THEN 
RTS DONE! 
SRT1 SEC 
LOA N K = N = M 
SBC M 
STA K 
LDA Neti 
SBC Mr1 
STA Kei 
LDA #1 J = 1 
STA J 
LDA #0 
STA Jtl 
SRT2 LDA J 
STA I 
LDA Jel 
STA Iti 
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0328- 
03296 
032B = 
032D- 
052 - 
0531 + 
03336 
0335¢ 
0557" 
02396 
0338- 
033D- 
053F - 
0341- 
0343- 
03%% - 
0346-2 
0348» 
OIGA» 
034C- 
034E = 
0350- 
0351- 
0353- 
0355% 
0357 
0359“ 
0358” 
0350“ 
035E» 
0360» 
0362- 
0364- 
0366“ 
0568“ 
0369- 
0 36B- 
036D- 
036F = 
03710 
0372- 
0374- 
0376» 
0378- 
0374” 
0378“ 
0370“. 
037 - 
0381 
0383-6 
0385-6 
0387- 
0389- 
0388- 
038Ce 


08 
06 
OA 
09 
07 
0B 
09 
10 
12 
01 
1) 
15 
02 


10 
08 
10 
11 
09 
11 


12 
OA 
12 
13 
0B 
13 


E3 
10 
14 
12 
16 


10 
15 
12 
17 


14 
16 
31 
07 


02 
F3 
28 
00 
14 
12 
16 
10 


15 


1520 SRT3 CLC 


1530 
1540 
1550 
1560 
1570 
1580 
1590 
1600 
1610 
1620 
1630 
1640 
1650 
1660 
1670 
1680 
1690 
1700 
1710 
1720 


SRT 


1730 


1740 
1750 
1760 
1770 
1789 
1790 
1800 
1810 
1820 
1830 
1840 
1850 
1860 
1870 
1880 
1890 
1900 
1910 
1920 
1930 
1940 
1950 
1960 
1970 
1980 
1990 
2000 
2010 
2020 
2030 
2040 
2050 
2060 


SRT5 


SRT6 


LDA 
ADC 
STA 
LDA 
ADC 
STA 
LDA 
STA 
STA 
LDA 
STA 
STA 
LDY 
CLC 
LDA 


I 

M 

L 

Tel 
Mt} 
L+i 
ADRA 
PRI 
PTRL 
ADRAt1 
PTRIrF1 
ກ +) 
#2 


PTRI 


AOC 1 


STA 
LDA 
ADC 
STA 
CLC 
LDA 
ADC 
STA 
LDA 
ADC 
STA 
DEY 
BNE 
LDA 
STA 
LDA 
STA 
INY 
LDA 
STA 
LDA 
STA 
DEY 
LDA 
CMP 
BCC 
BNE 
INY 
ເດກ 
BNE 
BES 
LOY 
LDA 
STA 
LDA 
STA 
INY 
LDA 


PTRI 
PTRI+1 
1) 
PYRI +1 


PTRL 

L 
PTRL 
PTRL+1 
Lei 
PTRL+1 


SRT4 
(PTRI) +Y 
ADRI 
(PTRL)» Y 
ADRL 


(PTRI) Y 
ADRI +1 
(PTRL) , Y 
ADRL $1 


(ADRI) + Y 
(ADRL) Y 
SRTB 
SRT6 


LEN 

SRT5 
SRT8 

80 

ADRI 
(PTRLI Y 
ADRL 
(PTRI) Y 


ADRI+) 
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INITIALIZE PTRS 
TO ARRAY A 
ADDRESS 


PTR TO A(T) = 
ADDR ARRAY A + 
ekx 


PTR TO ACL) = 
ADDR ARRAY A + 
2, 


DO 2 TIMES 
(PTR DISP IS 2 BYTES) 
IJ 2 AlI) 


ACL) 


LL 2 


COMPARE ONE BYTE IN 
RECORDS 1 6 L 


I < L 

1 > L 

I š L 

END OF RECORD? 
NO» NEXT BYTE 

RECORDS EQUAL 

ACT) ແ? ACL) 


038ແ- 91 12 
0390- AS 17 
0392 - 91 10 
| 0394- 38 
| 0395“ AS 08 
| 0 397 - E5 96 
| 0390,” 85 08 
| 1398- A5 09 
039D- E5 97 
0395 - 85 09 
| 0341" 90 06 
| 0343 - DO 83 
| 0365" AS 08 
| 03A7- DO FA 
03A9- E6 OC 
| 03AB- DO 02 
O3AD-= E6 9D 
O3AF> AS OF 
0381 - C5 0D 
0383- 90 98 
0385“ DO 06 
03B7=- AS OE 
ງ 399 - C5 0C 
J3BB- 90 93 
33BD= ແແ 20 03 
93COe 4C 00 03 
£$300.3C2 
9300" 46 07 66 06 DO 
0308e DO 01 60 38 A5 
0319+ 85 DE AS 05 E5 
0318- A9 01 85 OC A9 
0320» A5 OC 85 08 AS 
0328+ 18 AS 08 65 06 
0339- 09 65 07 85 08 
0338- 10 85 12 A5 01 
03%0- 13 A0 02 18 AS 
03486 85 10 AS 11 65 
0350- 18 A5 12 65 OA 
0358” 13 65 9B 85 13 
0369r 81 10 85 14 B1 
0368" C8 81 10 85 15 
0370. 1? 83 81 14 Dl 
0378» DO 07 CB C4 02 
03890- 28 A0 00 AS 14 
0388- 16 91 19 CB AS 
03902 A5 17 91 10 38 
0398 - 06 85 98 A5 09 
03A02 09 90 96 DO 83 
0348“ FA E6 06 DO 02 
0380- OF C5 00 90 OB 
0388» 0E 
0369 - 4c ÍO 


2079 
2080 
2090 
2109 
2110 
2120 
2130 
2140 
2150 
2169 
2170 


2180 SRT7 


2190 
2200 


2210 SRT8 


2220 
2239 
2240 
2259 
2260 
2270 
2280 
2290 
2509 


SRT9 


2310 JMP1 
2320 JMP2 


2330 


05 85 
04 ES 
07 85 
.00 85 
00 95 
85 OA 
45 00 
85 11 
10 85 
09 85 
85 12 
88 00 
12 85 
81 12 
16 90 
DO F3 
91 12 
15 91 
A5 08 
E5 07 
A5 08 
E6 00 


07 
06 
OF 
00 
09 
AS 
85 
85 
08 
11 
A5 
E3 
16 
85 
ຽ1 
FO 
A5 
12 
E5 
85 
DO 
A5 


DO 06 AS 
C5 0C 90 03 4c [20 03 


STA ເວກ), 
LDA ADRL+1 
STA (PTRI) Y 


SEC 

LDA I 
SBC M 
STA I 
LDA Trl 
SBC Mtl 
STA 1] 
BCC SRT8 
BNE SRT3 
LDA I 
BNE SRT7 
INC J 
BNE SRT9 
INC Ji 
LDA K+1 
CMP Jeti 
BCC J MP2 
BNE JMP1 
LDA K 
CMP J 
BCC JMP2 


JMP 
JMP 
eEN 


SRT2 
SORT 


IF I > 9 THEN STRS 


າ 


IF J > K 


THEN SORT 
ELSE SRT2 


CHANGE IF RELOCATED 
CHANGE IF RELOCATED 


SYMBOL TABLE 


ADRA 
M 

J 

PTRL 
SORT 
SRT3 
SRT6 
SRT9 


CHANGE IF RELOCATED 
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0999 
9906 
000C 
0012 
0399 
0328 
0381 
O3AF 


LEN 0992 N 

1 0098 L 

K 009E PTRI 
ADRI 0014 ADRL 
SRT1 0398 SRT2 
SRTü 9343 SRT5 
SRT7 93A3 SRT4S 
ປາວ] 0380 JMP2 


9994 
090A 
0010 
0916 
0320 
0372 
0369 
0360 


